AITopics | intermediate step

a2802cade04644083dcde1c8c483ed9a-Paper.pdf

Neural Information Processing SystemsApr-26-2026, 18:13:53 GMT

artificial intelligence, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)

Add feedback

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Neural Information Processing SystemsFeb-19-2026, 03:53:29 GMT

Language models are increasingly being deployed for general problem solving across a wide range of tasks, but are still confined to token-level, left-to-right decision-making processes during inference.

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Genre: Research Report (0.68)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Add feedback

f25602918e8a0d0c86e3c752ecfbbaa1-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 22:22:34 GMT

artificial intelligence, dataset, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > China > Shaanxi Province > Xi'an (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.30)

Add feedback

Evaluating and Improving Tool-Augmented Computation-Intensive Math Reasoning

Neural Information Processing SystemsFeb-11-2026, 10:05:58 GMT

Recently, large language models (LLMs) ( e.g., GPT -3 and ChatGPT) have shown remarkable zero-shot and few-shot performance on various tasks [

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Asia > China > Beijing > Beijing (0.04)
Europe > Portugal > Lisbon > Lisbon (0.04)

Genre:

Research Report (0.68)
Workflow (0.46)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

a2802cade04644083dcde1c8c483ed9a-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 09:30:45 GMT

algorithm, graph, systematic generalisation, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Massachusetts (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)

Add feedback

Why think step by step? Reasoning emerges from the locality of experience

Neural Information Processing SystemsDec-27-2025

Humans have a powerful and mysterious capacity to reason. Working through a set of mental steps enables us to make inferences we would not be capable of making directly even though we get no additional data from the world. Similarly, when large language models generate intermediate steps (a chain of thought) before answering a question, they often produce better answers than they would directly. We investigate why and how chain-of-thought reasoning is useful in language models, testing the hypothesis that reasoning is effective when training data consists of overlapping local clusters of variables that influence each other strongly. These training conditions enable the chaining of accurate local inferences to estimate relationships between variables that were not seen together in training.

locality, name change, reasoning emerge, (7 more...)

Neural Information Processing Systems

Genre: Research Report (0.39)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.87)
Information Technology > Artificial Intelligence > Machine Learning (0.81)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.35)

Add feedback

Evaluating and Improving Tool-Augmented Computation-Intensive Math Reasoning

Neural Information Processing SystemsDec-25-2025, 01:04:05 GMT

Chain-of-thought prompting (CoT) and tool augmentation have been validated in recent work as effective practices for improving large language models (LLMs) to perform step-by-step reasoning on complex math-related tasks.However, most existing math reasoning datasets may not be able to fully evaluate and analyze the ability of LLMs in manipulating tools and performing reasoning, as they often only require very few invocations of tools or miss annotations for evaluating intermediate reasoning steps, thus supporting only outcome evaluation.To address the issue, we construct CARP

name change, proceedings, tool-augmented computation-intensive math reasoning, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.84)

Add feedback

Smaller Models, Smarter Rewards: A Two-Sided Approach to Process and Outcome Rewards

Groeneveld, Jan Niklas, Qin, Xi, Schaefer, Alexander, Oren, Yaad

arXiv.org Artificial IntelligenceDec-11-2025

Generating high-quality code remains a challenge for Large Language Models (LLMs). For the evolution of reasoning models on this task, reward models are a necessary intermediate step. These models judge outcomes or intermediate steps. Decoder-only transformer models can be turned into reward models by introducing a regression layer and supervised fine-tuning. While it is known that reflection capabilities generally increase with the size of a model, we want to investigate whether state-of-the-art small language models like the Phi-4 family can be turned into usable reward models blending the consideration of process rewards and outcome rewards. Targeting this goal, we construct a dataset of code samples with correctness labels derived from the APPS coding challenge benchmark. We then train a value-head model to estimate the success probability of intermediate outputs. Our evaluation shows that small LLMs are capable of serving as effective reward models or code evaluation critics, successfully identifying correct solutions among multiple candidates. Using this critic, we achieve over a 20% improvement in the search capability of the most accurate code out of multiple generations.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2510.23083

Country: North America > United States > California (0.46)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Efficiently Learning Branching Networks for Multitask Algorithmic Reasoning

Li, Dongyue, Zhang, Zhenshuo, Duan, Minxuan, Dobriban, Edgar, Zhang, Hongyang R.

arXiv.org Artificial IntelligenceDec-2-2025

Algorithmic reasoning -- the ability to perform step-by-step logical inference -- has become a core benchmark for evaluating reasoning in graph neural networks (GNNs) and large language models (LLMs). Ideally, one would like to design a single model capable of performing well on multiple algorithmic reasoning tasks simultaneously. However, this is challenging when the execution steps of algorithms differ from one another, causing negative interference when they are trained together. We propose branching neural networks, a principled architecture for multitask algorithmic reasoning. Searching for the optimal $k$-ary tree with $L$ layers over $n$ algorithmic tasks is combinatorial, requiring exploration of up to $k^{nL}$ possible structures. We develop AutoBRANE, an efficient algorithm that reduces this search to $O(nL)$ time by solving a convex relaxation at each layer to approximate an optimal task partition. The method clusters tasks using gradient-based affinity scores and can be used on top of any base model, including GNNs and LLMs. We validate AutoBRANE on a broad suite of graph-algorithmic and text-based reasoning benchmarks. We show that gradient features estimate true task performance within 5% error across four GNNs and four LLMs (up to 34B parameters). On the CLRS benchmark, it outperforms the strongest single multitask GNN by 3.7% and the best baseline by 1.2%, while reducing runtime by 48% and memory usage by 26%. The learned branching structures reveal an intuitively reasonable hierarchical clustering of related algorithms. On three text-based graph reasoning benchmarks, AutoBRANE improves over the best non-branching multitask baseline by 3.2%. Finally, on a large graph dataset with 21M edges and 500 tasks, AutoBRANE achieves a 28% accuracy gain over existing multitask and branching architectures, along with a 4.5$\times$ reduction in runtime.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2512.01113

Country: North America > United States > Pennsylvania (0.28)

Genre:

Workflow (1.00)
Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning

Xu, Haolei, Yan, Yuchen, Shen, Yongliang, Zhang, Wenqi, Hou, Guiyang, Jiang, Shengpei, Song, Kaitao, Lu, Weiming, Xiao, Jun, Zhuang, Yueting

arXiv.org Artificial IntelligenceDec-1-2025

Large language models (LLMs) have achieved remarkable progress on mathematical tasks through Chain-of-Thought (CoT) reasoning. However, existing mathematical CoT datasets often suffer from Thought Leaps due to experts omitting intermediate steps, which negatively impacts model learning and generalization. We propose the CoT Thought Leap Bridge Task, which aims to automatically detect leaps and generate missing intermediate reasoning steps to restore the completeness and coherence of CoT. To facilitate this, we constructed a specialized training dataset called ScaleQM+, based on the structured ScaleQuestMath dataset, and trained CoT-Bridge to bridge thought leaps. Through comprehensive experiments on mathematical reasoning benchmarks, we demonstrate that models fine-tuned on bridged datasets consistently outperform those trained on original datasets, with improvements of up to +5.87% on NuminaMath. Our approach effectively enhances distilled data (+3.02%) and provides better starting points for reinforcement learning (+3.1%), functioning as a plug-and-play module compatible with existing optimization techniques. Furthermore, CoT-Bridge demonstrate improved generalization to out-of-domain logical reasoning tasks, confirming that enhancing reasoning completeness yields broadly applicable benefits.

arxiv preprint arxiv, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2505.14684

Country: Asia (0.28)

Genre:

Workflow (1.00)
Research Report > New Finding (0.46)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.94)

Add feedback

Filters

Collaborating Authors

intermediate step

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

a2802cade04644083dcde1c8c483ed9a-Paper.pdf

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

f25602918e8a0d0c86e3c752ecfbbaa1-Supplemental-Conference.pdf

Evaluating and Improving Tool-Augmented Computation-Intensive Math Reasoning

a2802cade04644083dcde1c8c483ed9a-Paper.pdf

Why think step by step? Reasoning emerges from the locality of experience

Evaluating and Improving Tool-Augmented Computation-Intensive Math Reasoning

Smaller Models, Smarter Rewards: A Two-Sided Approach to Process and Outcome Rewards

Efficiently Learning Branching Networks for Multitask Algorithmic Reasoning

Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning